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ABSTRACT 


It is well known that the standard estimator for variance 
components in analysis of variance, Model II, of , can be 
negative with positive probability. In practice, when such 
an estimator is found to be negative it 1s taken to be zero. 
Very little is known about the properties of the correspond- 
ing truncated estimator. This thesis investigates the vari- 
ance and bias of the positive truncated estimator of - A 
method of selecting &, the number of classes, is presented 
that produces maximum power for a test of the hypothesis that 

2 


o_ = 0 while keeping the variance and bias as small as 


possible. 
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LT. INTRODUCTION 


The use of Model II or the Random Effects Model in anal- 
ysis of variance can best be described by a simple example. 
Suppose we draw a sample of 2 pieces of steel from the popula- 
tion of pieces which have been subjected to a particular 
annealing process. These 2 pieces may be considered as a 
random sample from the population composed of all such pieces 
of steel which have been or will be produced by this specific 
process. We might wish to determine the variation of flex- 
ural rigidity after the annealing process between the various 
members of the whole population. If exact measurements of 
flexural rigidity could be taken from the pieces on hand, the 
variance could be derived from straight forward statistical 
methods. However, the experimental methods used to measure 
flexural rigidity are subject to error. This error is re- 
flected in the fact that if several measurements are taken 
from one piece of steel, the results are not always exactly 
the Same. In fact, it may be the case that the measurement 
(experimental) errors are of the same or greater magnitude 
than the variation we wish to measure between the true rigid- 
ities of the different pieces. Using analysis of variance - 
Model II, it is possible to separate and isolate these two 
different causes of variation and to obtain an estimate of 
the true variation of rigidity. 

The data for such an analysis will consist of several 


different measurements of flexural rigidity taken from each 








of the £ pieces of steel. If we take r measurements on each 
of the 2 pieces, then the total number of measurements will 
be £r=N. 


The Model II analysis of variance now takes the form 


Tas me Mae ela: Bast t= 2 7? f= 12 ee (lees) 


The following assumptions and definitions are standard 


for this model: 


Ya represents the ce measurement of the cee piece 

of steel, 

wu is the “true" mean flexural rigidity of the population 
and is assumed constant, 


1 


as is the deviation from the mean of the i piece. 


The a; are assumed to be distributed N (0, ao, 
ej; is the measurement error of the sca measurement 
on the gf piece. The C45 are assumed to be 

distributed N (0, me 

as and Qi5 are assumed independent. 

For the balanced one-way classification Model II analysis 
of variance just described, it is well known that the minimum 


Variance unbiased estimator for the true variation between 


the pieces is 


52 yc ee ce (1.2) 


r(¥, - ¥)°/(2-1) 
a 


eet Rone 


where MS = 
a 


Q ite 
andes =) eigen — 7.) /li eae 
— fe je 7 
Leone and Nelson [Ref. 3] found from an empirical study 


that this estimator can be negative with probability as high 








as .40. In practical applications the estimator is taken as 


zero whenever it is negative. This then produces a trunca- 


tion of the true distribution of the minimum variance un- 


biased estimator. The truncated estimator takes the form 


A2 . 

~ > 

of 2 am eats MS. MS , (1.3) 
0 OEhnernwice: 


The properties of this truncated estimator are unknown at 


Present. 


Consider a situation where N, the total number of exper- 


iments, is fixed. Within this framework, this paper 1s con- 


cerned with an empirical investigation of the properties of 


the estimator se : 
The following questions are considered: 


1. What can be said concerning the effects of 
various choices of 2 and r cn the bias and 


; ~2 
varlance of om ? 


2. How does the variance of se compare with the 


: 2 ' 
variance of oe for a given N and 2? 


3, Can an allocation method for 2 be found to 


yield minimum variance or minimum bias for 


er, 
a 


4, If such an allocation method can be developed, 


how does it compare with the allocation formula 


for r developed by Hammersley [Ref. 1] to mini- 


Z 
fe} 
mize the variance of ge when K = > is known? 
Ss 
5. If nothing is known about K is there a "best" 


alll@ocatiLon method LOr wa 








6. If we are testing the null hypothesis Ho? 


af = 0 against Hy: o* *# 0, how does the alloca- 


tion of 2 affect the power of this test? 








II. METHODOLOGY 


Dae LHEORY 


In order to investigate the behavior of the estimator it 
1s first necessary to develop the distribution of ae and 
expressions for the bias and variance of this estimator. 


i sas Er Pout on vor ae 


Let U and V be two independent random variables with 


Pearson Type III distribution so that 


tT, +1 
Y a -Y,u 
2. — if u>o 
r(t,+1) 
IL 
0 otherwise 
tTat+l 
/ TAY TY5V 
ema L(y) = ag, 2 e 2 me y > 0 
M(t5+1) 
0 otherwise. 


Pearson [Ref. 4] found the distribution of Y = U - V to be 


tT +1 Totl 


ie ie Ge Ee To (tyth) 


as ia I aS SESE Y [ + Ii (yity.)y 
g(y) ms Tt 41 ! WEN y 


t.(tT.-L) (t.+1) (1.42) 
eee 2 ee coe] 1 Vea © 


2a 
21(¥,+Y5) “y 


(2253) 








YY ao) eee acpi) 


= 1 
ht nn CM Seer ery 
P(t5+1) (¥, +75) eee 


ty (tT, 7-1) (T3541) (7, +2) 








=F 5 5 pet |) cakic Sie [gy 
' = 
2'(y,+Y>) (-y) 
(2.1) 
He also showed that 
\ (y) a ead 
G\yY a er ior 1G oe al Taal) s (222) 
Yit7¥> 1 2 
y Totl Ty, 
yg(y)dy = —— — —— (1T,+1l, t,+1) 
Y> Y¥itY5 al: 2 
tT, +1 Ly 
= ————- (tT, +2, I.) (2) 
Yq Y1tY> I) Dis 
~(T.+1L) ~(t_+1) 
and S e"Yg(y) dy = (1- *) a (eee : 
O 2 Yo 
Ty,+t 
. er cel Ons gee oo (2.4) 
YitY> 1 2 


where I, (p,q) is the ratio of the incomplete beta function 


to the complete beta function. 


- 
= 
4 


If we choose 1 Sy 0 eatbs = ile 1 SS (2.5) 


ana T5 = 


Ny 


- i}. 


and substitute these values into 2.1, we obtain the density 
PUnetLOn Of -Y = aX, - DX,, where xX) and X. are independent 
chi-square variables with n and m degrees of freedom respec- 


tively and a and b are positive constants. This density 


function becomes 








1 
1 sy/2a_2 m 
n (arm) ™ € 3 2F ole s (1 = 7x) 7 
r(By2%a : (atb) 2 
-~2ab 
Tatbyy) ° ¥ > ° 
g(y) = 
aie 
1 =V/ Zaye n So 
ee ze 2FoI5 . (1 - >) 
r(8)2*%b * (atb)* 
2 Zab <0 
(aco )iya ¥ 
(2.6) 
where 
2Fo[pra:X] = ) (p) (a), or 
n=0 
with 
= a 7 ee ela) 
vay = a(atl) (atn-L) = 
Now let 
Y if Y > 0 
Yt = 
0 otherwise. 
The distribution for Y° will be 
Hyt ty) = % g (y) dy y = 0 (2.7) 
yt 
g(y) dy , yro=, 


0O 


10 








and using Eq. (2.6) we get the density function 


1 ean 
if b (>, x) v y = 0 
atb 
ae ayy) Dy, eee) 
0 , otherwise. 


(2.8) 


fem equations (2.3), (2.4), (2.5), and (2.8) it follows that 


hee mon, _ m n 
lg ) =na I 5 (>, >) mb I = (> + ub 7 a li (229) 
atb atb 
and 
es awe 

le Vict _ n m a Z 2 Mm 22 

E(e ) =I b (>, x) + (l-2at) ~(1+2bt) T(1+2bt) ‘oe x) - 
atb atb 
(22a) 


ier alstrioutzon OF af is the same as that of ve Wailea 


n= £-l m= &£(r-l), 


a2 + xo? 32 
a= “T(R-1Ty) and b= zr r-1 ° Ze 15) 


2 
a 


2. Variance and Bias of o 
As indicated above, the distribution of of is the same 
as the distribution of Y° eke the proper choice of n, m, a, 
and b. Thus, Eqs. (2.9) and (2.10) give the expected value 
o}e 5 and its moment generating function when of is substi- 
tuted for y and m, n, a, and b are defined as in Eq. (2.11). 


The variance of af can now be derived by recalling 


that 


el 








2 2 
var (64) = [25 - (=) | ; (2.12) 
dt pad 
=2 
where M = Bec , 


Applying successive derivitives to Eq. (2.10) and evalu- 


ating at t = 0, the expected value of of 1s found to be 
2 el s m it m n 
ey = ar! t=0 gee) (ys 7) WD Ee ep yo 
atb atb 
2a pa 2) e-em By) (2.13) 
atb ‘atb a+b VE 2 Oy : 


where ae = 1s the beta function with parameters m/2 and 


zZ 
Ay2, and | = (acn+ Poacn= 2abmnt2>entb “in ie _ (>, = 
dt~™ t=0 = 
atb 
2ab a he - b 3 - m n 
arp ‘On 5 bm * 2a © 20) orb (S45! AES ST 


( 2s ) 
Equation (2.13) can be shown to be equivalent to Eq. (2.9). 
eamacing Eq. (2.13) and subtracting from Eq. (2.14), the 


; sa ag 
Variance of oe 1s obtained as 


a 2 Z m n 
Var (o5) = 2(na” + mb") I, (ys 
atb 
m hie m n 
+ (na - mb) [na I = (>, >) - mb I = (> ae 7 1) ] 
atb a+b 
mt eel 
rs ee nm © eae mn mn 
ats Deora ce sane) b(a-b) /8 (5,5) -[na I Es (15) 
a+b 
m n 2 
-mb I _ (= + i > dls) ; 
atb 
(2.255) 


eZ 








From Eq. (2.9), the bias Of ene estimator Jsuqinvene., 


i a m Th tee. m ger paneeilaee 
bias = nal => (> 5 5) mb Iat(>+ 1, 5 1) ae 
atb 
(2216) 
The expression for the variance of of is 
2 
~2 9 of Ts ro*) oe 
Var (0_) = ~+[—35——_- + py |! ~C. C27 
a #2 r7 (2-1) X(r-1) 


Thus, values for the variance of the minimum variance un- 
biased estimator, ae can be computed for a comparison with 


the variance of ae for fixed N and 2 


3. Power 
In considering the problem of selecting an g fora 
fixed N when testing a given hypothesis based on the sample, 
the power of the test 1S an important consideration. Suppose 
the null hypothesis Ho: os = 0 is being tested against the 


o* *# 0 from a sample of 2 classes, 


alternative hypothesis Hy: 
each class consisting of r observations. From the analysis 
MS 
of variance table, the test statistic is found to be F = us 
SS. SS. i 
where MS. = al and MS = Q(r-1) - 
dite be shown that ae a 
ee a and ee are 
a2 + ro- a? 
e a e 
independent chi-square variables with%- 1 and &(r-1l) degrees 
MS. 
of freedom respectively. The statistic me May now be re- 
e 


Written as 


iPS 





7 
- - 





Ss. 
(45) / (2-1) 


i! oe + ro. 
= as d (2.18) 
(—) /% (r-1) 
0 
e 
and is a ratio of independent chi-square variables divided 


by their degrees of freedom and is distributed as 


1) ; Diane where Pig eel (eferquercetl id Vyshegalevollicy yfaliec 


a and b degrees of freedom. 


lie H is true, 1.e., of = 0 the test statistic 


SS_/ (2-1) 
S8_/U(E=1) ile Greteres JOUneers ets DBL) - Gr) 


Thus, a test of the null hypothesis consists of re- 
jecting H, ata level of significance a , if 
F 2 Fy (e-ay, Rlx-2) * 


The power of this test, denoted 8(9), is given by 


us. 2 
B (8) = Pls ? Se Me (Geel)! where 9 = = e 
Sa 


Bat if of *# 0, then 


MS 
a 
2 2 2 
i, Sisk FTO, Se Fas (2-1), 24-1) 
MS 2 2 
e 10) + ro 
ae e a 
Oo 
a 
2 
Lae we (2-1), &(r-1l) 


= P[F > et] 
(2-1), %&(r-l) 62 a ae 
e a 


14 








This expression can be evaluated after the transformation 


l 
i 


Loe 
Y> Wi ye 


The variable Y 1s distributed as BY ss Y5) F 


Thus 
= ~ ri-—— a < salt ia ae ne al 
ate L-l)o- F 
1 + ke oy Pe-1, &(x-1) Te eS a 
2(L-1) (o_ + ro7) 
e a 
or 
B8(8) = I {%-1, 2(xr=-1) ] 
where 
X = - = 
ote Gen Gr (eu aeecedD 


1 + 
2 2 
ges 
L ( 1) (Oo, 4 ey 


yields the power of the test of hypothesis Ho: of = 0, fora 


Specified a and N. 


B. DATA GENERATION 
Bee Ole GrSrsue2 eo een 2 > can 2 2 LOR man @ii(2 hoe ie Cal moe 


seen that each of the properties to be analyzed is dependent 


2 


on four variables: Oar 


ty rand &. Recall that N = gr is 
the total number of experiments to be conducted. If N is 
fixed the choice of either 2 or r determines the other. Thus, 


for fixed N, we have only three variables, 2&, ae and of . We 


,; 2. 
now wish to see what happens to the bias and variance of ie 
and the power of the specified test of hypothesis as the three 


variables take on a range of values. 


15 








Calculation of these statistics was done on an IBM 360/67 
computer using the basic program shown in Appendix A. [In 
addition to this basic program, the IBM supplied subroutines 
for computation of the beta-distribution were also used. 

The values chosen for N were 12, 16, 20, 40, 50, 80 and 
100. For each value of N, & varied through all integral 
mryisors of N such that 4 < 2% Ke. For example, for N = 80, 
& took on the values 4, 8, 16, 20, and 40. 

Initially, for each combination of N and &, both of and 
om were varied from .l to 2.0 in steps .1 and again from 1.0 
to 20.0 in steps of 1.0. Values were computed for the vari- 
ance and bias of of , the variance of of and the power of the 
specified test of hypothesis for all possible combinations of 
i ae ae of, and of in the ranges described. 

The data generated in this manner supported the conten- 
tion of Scneffe's that of is simply a scaling factor for both 
the bias and variance of af and the variance of of Figures 
1 and 2 illustrate the scaling influence of of on the bias 
and variance of af when N = 20 and & = 4 and 5. 

As for the power of the test of hypothesis, Scheffe' has 
shown 8(8) to be dependent only upon the ratio 0/02 and 
again of can be considered as a scaling factor. 

Based on these considerations, of was set at one for all 
data generated for use in this thesis. This greatly reduced 
the amount of time and output required for computer runs and 
further reduced the number of input variables to two, & and 


of for each fixed N. Further, if of = 1, the value of om 1s 


16 
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Figure 1. Graph showing the effects of the scaling 


influence of a on the variance of oe hoO@an —  20e 
The three upper curves are for 2 = 4, and the 
three lower for 2 = 5. 
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Figure 2, Graph showing the scaling effects of 
of on the bias of of for N = 20. The top curve 
in each pair is for 2 = 4, and the bottom curve 
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lpn 


also the value of the ratio of K = Since of 1s a scal- 


Q 
MN 


ong factor, all conecluclon scmavmeus dnc of = Pac eG darlin, 
valid for any other ae . The direction of change for £ixed 
oom is as fcllows: as of increases (decreases) the variance 
and bias increase (decrease) and the power of the test of the 
test of hypothesis decreases (increases). The magnitude of 
the change depends on the magnitudes of of and af , N andes 

In order to evaluate the power of the test of hypothesis, 
it was necessary to choose a level of significance, alpha. 

An alpha of .05 was used throughout this paper. 

Wang [{Ref. 8] has conducted a similar study of the bias 
and variance of several estimators, including 5° . Her study 
was restricted to the special case where Xy and Xo took on 
only even degrees of freedom and N took on values of 9, 27, 
8l and 225. There are no direct points of comparison between 
the data she generated and the data in this thesis. However, 
a very favorable comparison of computed variance and bias 


exists for value of N and & as nearly matching as is possible. 


Wang's variance and bias expressions for N = 8l, & = 9, and 
o* = 1.0 yield var (62) = .309 and bias (54) = 0 while the 


data from this thesis for N = 80, & = 10, and of = 1.0 yield 


var (54) = ,282 and bias (54) = 0, 


Ihe 








Til. PRESENTATION OF FINDINGS 


A. GENERAL OBSERVATIONS 
Before attempting to address the specific question of the 


selection of 2, several general conclusions can be drawn from 


Z 2 


the data regarding the bias of of, a 


the variance of oe and 6 
and the power of the specified test of hypothesis. 
Generally speaking, the bias of se 1s very small. As 
Shown in Table I, the bias decreases as K, the ratio of 
a /oe increases. For small N and small K, the bias is sig- 
nificant. However, if K is greater than 1.0 and N greater 
than or equal 20, the magnitude of the bias is $0 small that 
it can be neglected. For this range of N and K, the maximum 
value the bias assumed is less than one percent of the true 
value of of, Thus, oe 1s virtually unbiased in this range. 
The bias of of was found to be negative for many combina- 
tions of N, &, and K. However, for the entire range of input 
variables for which data was generated, the negative bias was 
always insignificant to the fourth decimal place. 
As shown in Table II, the variances of of and oe are very 
nearly the same except when K is small. For small values of 
K there is a significant difference between the two. However, 
this difference decreases sharply as K increases and is neg- 
ligible for K > 1.0. The difference between the variances 
is further decreased as N increases. Thus the variances of 
ae and om , appear to approach each other asymptotically as 


N and/or K increase. 


20 








FABLE I 


EFFECT OF N AND K ON THE BIAS OF af 
2 2 
o. = K, £= 4, oo eo) 

N K BIAS N K BIAS 
Le wl 0951 40 gal 2Oon5 3 
o5 -0478 .5 20037 

1.0 0270 1.0 0016 

2.0 .0129 2.0 0006 

iG ay MOED? 80 1 0045 
= roves Lg -0010 

Lae nose 1.0 .0005 

2.0 0061 2 50 0001 

20 1 /0452 100 All 0029 
<3 -O167 a5 .0004 

1.0 0081 ae .0002 

2a 0035 2.0 ,0001 


~~ 


Table II also indicates that the variance of i is always 
less than or equal the variance of 56 ;. The aneroductren of 
a small amount of bias by truncation of the estimator tends 
to reduce the variance. 

As is to be expected, the power of the test of hypothesis 
increases as N increases. In the model proposed here, power 
is also a function of 2 when N is fixed. For all values of 
Ieeested, it was Eound that if N > 16 and & < oe B(O)? = sg 2G. 

This implies that for values of N > 16 and & < = Qe 


power criterion can be ignored in the selection of &. Atten- 


tion can then be directed to minimizing variance and/or bias 


2. 


yy 








TABLE II 


DIFFERENCES IN TRUNCATED (54) AND UNTRUNCATED (5*) 
VARIANCE ESTIMATORS FOR VARIOUS VALUES 


OF N, K, AND ARBITRARY 2& 


22 


No K ke VAR (64) VAR (64) 
1. 0.1000 r 0.0648 0.1530 
12 0.5000 0.3461 0.4907 
12 1.0000 4 1.0158 i 0 
i 2.0000 4 3.3828 3.6574 
12 5.0000 4 18.5601 18.9907 
20 0.1000 5 0.0336 0.0696 
20 0.5000 5 0.2376 0.2896 
20 1.0000 5 0.7305 0.7896 
20 2.0000 5 2.4754 2.5396 
20 5.0000 5 3, 72uls 13.7896 
40 0.1000 g 0.0170 0.0282 
40 0.5000 g 0.1347 0.1425 
40 1.0000 g 0.4093 0.4139 
40 2.0000 g 1.3831 1.3854 
40. 5.0000 g 7.7275 2 FOZ 

100 0.1000 20 0.0081 0.0105 

100 0.5000 20 0.0525 0.0526 

100 1.0000 20 0.1526 0.1526 

100 2.0000 20 0.5105 0.5105 

100 5.0000 20 2.8473 2.8473 








in selecting & with the assurance of a very strong test of 


the hypothesis. 


Be LHE SELECTION OF {2FOR@ eee DN 

In the model being studied, it has been assumed that the 
number of observations of the random variable being observed 
is fixed at N. Further, it is assumed that r observations 
will be made on each of 2 classes of the observable phenom- 
enon so that gr = N. The problem now arises of how to choose 
& (or r) so as to obtain the best statistical results. The 
problem is complicated by the fact that the "best" solution 
is dependent on the desired result of the analysis. For ex- 
ample, the & that provides the most powerful test of hypothesis 
for a given N may very well produce maximum bias in our esti- 
mate of a . In the same manner, the 2 that provides minimum 
bias or minimum variance in the estimator may produce a very 
weak test of the hypothesis that of = 0. 

The selection of 2 is further complicated by the fact that 
all of the parameters of interest are dependent on kK. 


Hammersley [Ref. 1] developed an expression for r which 
2 


produces minimum variance in om , the unbiased estimator of 
of . Equating the first derivative of the expression for the 


Variance of oe to zero, Hammersley showed that the integral 
divisor of N that most nearly satisfies 


2 (seedless a2 IL. 
h Fe Gece 

produces minimum variance in af . For the range of N and & 
used in this study, the value of ry also produce minimum 


: = 
varlance for om Bs 
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The rh proposed by Hammersley has two unpleasant features. 
First, for some combinations of N and K, the power of the 
Specified test of hypothesis is very low. For example, if 
K = .5 and N = 12, the power of a test of Ho: of = 0 is only 
mzool. 

The second and perhaps more serious feature is that 
Hammersley's solution for ry requires a knowledge of the ratio 
af /o% prior to conducting the intended analysis. In an envi- 
ronment such as the flexural rigidity experiment where the 
general magnitudes of af and of would be known from previous 
experiments on similar products this requirement may not be 
serious. However, for a one-time-only experiment, or an 
evaluation of a new process this requirement may be completely 
unreasonable. 

The results of the present study indicate that power is 
maximized for small 2 while variance iS minimized when Q as- 
sumes its maximum value of N/2. But it has already been shown 
Pict power is not a major consideration for N > 16 if 24 7# =: 
It would appear then that 2 should be selected very near to 
bur not equal to N/2. 

Based on these considerations it appears that 


NL 
he = ly a 


such that 3 is an integer is the "best" choice for 2, that is, 
Ry is the next smaller integral divisor of N. As an example 


for N = 20 


_ 20 re = 
ge ee eS 


fooene Dest Clomee Lor 2. (See Table III.) 
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Table III shows a comparison of 2, (2, = bisa and £ for 
Dae ry g 


various values of N. It also shows the power, variance, and 
bias generated for each choice of ae and dy in a range of K 
values. It can be seen that dy increases with K to its 


maximum value of N/2 while he is fixed for a given N. Also, 


when a = N/2, the power of the test is small for small N. 


In fact, for N as large as 100, the power using &,, May be 


less than .9 while power for dg never falls below .9 for any 
N. AS was expected, the variance of o* using dy is consid- 


erably less than the variance acquired using hy Since &, was 


@etmaivea as the minimum variance choice of 2, 


Generally speaking, the bias of the estimator when & = Me 
is less than or equal the bias when & = dy The only excep- 
means tO this being when K = .1 and N = 80 and 100. The bias 


h 


Same OL the true value of of if K > 1.0 and less than one 


for both & = “es and & = %& is generally less than three per- 


Bewecent for K > 1.0 and N > 20. 

Again it seems that the method of selecting %& depends on 
the desired results of the original analysis. an will always 
produce minimum variance in the estimate of of but requires 
a knowledge of the ratio K = gaye . Lf K is Known and a 
minimum variance estimator is desired, this is certainly the 


best method of choosing Q. 


If a powerful test of hypothesis is desired hg gives a 
much more powerful test for most combinations of N and K than 
will a . Lf nothing is known of K, Ps gives a powerful test 
and a relatively small variance. 
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IV. CONCLUSIONS AND RECOMMENDATIONS 


A. CONCLUSIONS 

It may be concluded that with one exception, 2 is the 
"best" metnod of choosing the number of classes for Model II 
analysis of variance when N is fixed. The exception occurs 
in the case where K is known and a minimum variance estimator 


One oe 1s desired without regard to the power of the test of 


hypothesis that of = 0. In this case on appears best. The 
use of & = ie assures a very powerful test of hypothesis and 


will yield a small, but not minimum variance in the estimator. 
For most combinations of N and kK, - also produces minimum 
: me 
plas ino. 
a 
bi N > 20 and K = 1,07 the buas of 5° is so small as to 
be negligible. In such cases, the use of the truncated esti- 
meacor Of of has no significant influence on the results of 
the analysis except to cause a sma].l decrease in variance. 
As N and/or K increase, this decrease in variance appears to 


tend toward zero. 


B. RECOMMENDATIONS FOR FURTHER STUDY 

It is suggested that a similar study of variance esti- 
mators be conducted for value of N greater than 100 for the 
full range of K values studied here. Such a study might also 
investigate values of K less than the minimum value of .l 


used in this study. 
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A much more difficult task that could follow the same 
general approach would be an investigation of two-way and 
multi-way analysis in an effort to determine the best number 
of experiments for each class to provide minimum variance in 
the variance estimators and maximum power for a specified 


test of hypothesis. 
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APPENDIX A 


THIS PROGRAM IS DESIGNED TO COMPUTE THE VARIANCE, POWER 
AND BIAS FOR THE Y+ ESTIMATOR OF THE BETWEEN CLASS VARIANCE 


FOR THE BALANCED, ONE-WAY ANALYSIS OF VARIANCE, MODEL II. 


EXPLANATION OF SYMBOLS: 
VAR IS THE TRUE WITHIN CLASS VARIANCE. 
VARA IS THE TRUE BETWEEN CLASS VARIANCE. 
fens LHe NUMBER OF CLASSES. 
oe iS TRE NUMBER OF EXPERIMENTS IN EACH CLASS. 


XK IS THE RATIO OF THE BETWEEN CLASS AND WITHIN CLASS 
VARIANCES. 


N IS THE TOTAL NUMBER OF EXPERIMENTS. N=L*IR. 


wee 1S THE Y+, OR POSITIVE TRUNCATION, OF THE BSTIMATE OF 
THE BETWEEN CLASS VARIANCE. 


VARR IS THE MINIMUM VARIANCE UNBIASED ESTIMATOR OF THE BE- 
TWEEN CLASS VARIANCE. 


AMEAN IS THE EXPECTED VALUE OF Yt. 


Peveeloe lie, POWER OF THE TEST OF HYPOTHESIS THAT THE TRUE BE- 
TIWEEN CLASS VARIANCE IS ZERO. 


XC IS THE F-STATISTIC FOR ALPHA=.05 AND L-1 AND L*(IR-1) 
DEGREES OF FREEDOM, USED IN COMPUTING THE POWER. 


THE SUBROUTINE BDTR COMPUTES THE PROBABILITY THAT THE RANDOM 
VARIABLE U, DISTRIBUTED ACCORDING TO THE BETA-DISTRIBUTION 
WITH PARAMETERS A AND B, IS LESS THAN OR EQUAL TO X; 
BDTR(X,A,B) 

THE FUNCTION EYPLUS COMPUTES THE EXPECTED VALUE OF Y+. 


PoE FUNCTION VYPLUS COMPUTES THE VARIANCE OF Yt. 


30 








1002 


1001 


1000 
100 


101 
HOD 


10 


READ 105) La lRexe@ 

VAR=1. 

VARA=0. 

EE ti eLo 00, L000 , 100m 

WRITE (6,100) 

XN=L-1 

XM=L* (IR-1) 

DO 3000 I=1,20 

VARA=FLOAT (I) 

A= (VAR+FLOAT (IR) *VARA) /FLOAT (IR* (L=-1) ) 

B=VAR/FLOAT (L* IR* (IR-1) ) 

XMEAN=EYPLUS (XM, XN,A,B) 

BIAS=XMEAN-VARA 

VART=VYPLUS (XM, XN,A,B, XMEAN) 

VARR= (2. /FLOAT (IR* *2) ) * ((VAR+FLOAT (IR) *VARA) **2/ 

LFLOAT (L-1) +VAR**2/FLOAT (L* (IR-1) ) ) 

X=1./(1+XN*XC*VAR/ (XM* (VAR+FLOAT (IR) *VARA) ) ) 

CALL BDTR(X,XN,XM,P,D,IER) 

POW=P 

N=L*IR 

XK=VARA/VAR 

WRITE (6,101)N,L,XK, VARA, POW, VARR, VART, BIAS 

GO TO 1002 

CONTINUE 

FORMAT(' N i XK VARA POWER VAR (UNTRUN) 
Bras (ERUN)I 7 7") 

HORMAT(? "sis 2X72, 3F8,4,2% 9864 2X Peed Oe) 

FORMAT (12 ,14,F6.2) 

END | 


FUNCTION EYPLUS (XM, XN,A,B) 
C=A/ (A+B) 

H=XM/2. 

E=XN/2. 

CALL BDTR(C,H,E,P?,D,IER) 
BET=P 

F=H+1, 

e--1 

ie (@) Sys alle 

EYPLUS=0. 

RETURN 

CALL BDTR(C,F,G,P,D, IER) 
BET1=P 
EYPLUS=XN*A*BET-XM*B*BET1 
RETURN 

END 
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FUNCTION VYPLUS (XM, XN,A,B,XMEAN) 
IF (XMEAN) 10,5,10 
EYPLUS=0. 

RETURN 

C=XM/2. 

H=XN/2. 

E=A/ (A+B) 

F=XN *A**2+XM*B**2 

CALL BDTR(E,C,H,P,D,IER) 
BET=P 

G=XN*A-XM*B 

YM=XM/2. 

YN=XN/2. 

GANM=YM+YN 

CALL GMMMA(YM,GX,IER) 
GAM=G6x 

CALL GMMMA (YN, GX, IER) 
GAN=GX 

CALL GMMMA(GANM, GX, IER) 
COG=GAM*GAN/GX 

P=E**C 

R=(1.-E)**(H-1.) 
XK=COG*P*R*2.*B 
VYPLUS=2. *F *BET+XMEAN+2. *XK * (A-B) -XMEAN* *2 
RETURN 

END 
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